Systems and methods for short identifier behavioral analytics

ABSTRACT

Embodiments relate to systems, devices, non-transitory computer-readable storage media, and computer-implemented methods for performing behavioral analytics using short identifiers by obtaining historical event data for multiple events and multiple short identifiers, where each event is associated with a short identifier, selecting a first event location associated with a first event, selecting a second event location associated with a second event that is geographically dispersed from the first event location, determining identifier collisions based on short identifiers associated with the first event location and short identifiers associated with the second event location in the historical event data, determining an average percentage of identifiers collisions based on the identifier collisions that were determined, performing behavior analytics on the short identifiers to generate behavioral models associated with the short identifiers, and generating adjusted behavioral models based on the average percentage of identifier collisions.

BACKGROUND

Behavioral analytics is a mathematical approach to analyzing patterns in historical data and predicting future actions for the same or similar entities associated with the historical data. For example, behavioral analytics can be used in application development to determine how users will use an application and to predict future usage and preferences, in distributed computer systems to predict usage trends and traffic loads on individual devices, in security systems to detect compromised credentials and threats by locating anomalous behavior, and in ecommerce and retail systems to recommend products, predict future sales, and adjust prices to increase revenue.

Behavioral analytics may be performed using massive amounts of historical event data. The historical event data can include identifiers that identify or distinguish between individual entities (e.g., devices, users, companies, etc.), and behavioral analytics systems can use the identifiers to determine patterns and predict future actions of individual entities. The use of short identifiers can reduce storage requirements for the historical event data, as well as increase processing speeds of behavioral analytics devices and transmission speeds in a distributed behavioral analytics system. Additionally, some historical event data sets may utilize short identifiers to provide a measure of security for the entities associated with the historical event data. For example, credit card transaction histories can be used to determine user patterns and predict future transactions, but the entire credit card number is private information and may not be available in the historical event data.

Short identifiers can be short strings or short integers that are used as identifiers. Short integers can be stored as a “short” integer variables in programming languages such as C, C++, C#, and Java™, and a “short” integer variable may be 16 bits or less in size, instead of, for example, a regular or “long” integer variable that may be 32 bits or more. Accordingly, more short identifiers can be stored in less space, short identifiers can be processed faster, and/or short identifiers can be transmitted faster than, for example, identifiers that use regular or “long” integers or long strings.

However, the use of short identifiers can result in complications in matching historical event data to specific entities. For example, if the short identifiers are short integers with a maximum of four digits, then the maximum number of unique identifiers is 10,000. If there are more than 10,000 entities represented in the historical event data, then identifier collisions can occur where at least two entities are represented by the same identifier. Generally, identifier collisions can result in less accurate pattern recognizing and/or future predictions.

Accordingly, there is a need for systems and methods for performing behavioral analytics using short identifiers, while mitigating the effect of identifier collisions.

SUMMARY

The present disclosure relates to systems, devices, non-transitory computer-readable media, and methods for performing behavioral analytics using short identifiers by obtaining historical event data for multiple events and multiple short identifiers, where each event is associated with a short identifier, selecting a first event location associated with a first event, selecting a second event location associated with a second event that is geographically dispersed from the first event location, determining identifier collisions based on short identifiers associated with the first event location and short identifiers associated with the second event location in the historical event data, determining an average percentage of identifiers collisions based on the identifier collisions that were determined, performing behavior analytics on the short identifiers to generate behavioral models associated with the short identifiers, and generating adjusted behavioral models based on the average percentage of identifier collisions.

In some embodiments the behavioral analytics can be used to determining new event values by obtaining a short identifier from a user device, determining, based on the short identifier, an adjusted behavioral model, determining an expected event frequency based on the adjusted behavioral model, determining an actual event frequency, and transmitting an indication of a new event value to the client device based on a determination that the expected event frequency is less than the actual event frequency, where the client device displays a notification corresponding to the indication of the new event value.

DESCRIPTION OF DRAWINGS

FIG. 1 depicts an example of a system environment, in accordance with some examples of the present disclosure.

FIG. 2 depicts an example of a flow diagram of a process for performing a collision decay analysis, in accordance with some examples of the present disclosure.

FIG. 3 depicts an example of a flow diagram of a process for performing a cohort analysis, in accordance with some examples of the present disclosure.

FIG. 4 depicts an example of a flow diagram of a process for performing a cohort analysis, in accordance with some examples of the present disclosure.

FIG. 5 depicts an example of a flow diagram of a process for obtaining a new event value based on a probability of a future event, in accordance with some examples of the present disclosure.

FIG. 6 depicts an example of a flow diagram of a process for determining an event value based on an identifier without a history, in accordance with some examples of the present disclosure.

FIG. 7 depicts an example of a flow diagram of a process for determining an event value based on an identifier with a history, in accordance with some examples of the present disclosure.

FIG. 8 is a diagram illustrating an example of a hardware system for performing and using behavioral analytics, consistent with certain disclosed embodiments.

DETAILED DESCRIPTION

The following detailed description is provided with reference to examples of embodiments for ease of description and understanding. The present teachings are not limited to the disclosed embodiments, and encompass other variations that fall within the overall scope of description provided herein.

As noted above, a behavioral analytics system can process historical event data that uses short identifiers to identify specific entities. The processing can result in behavioral models that are specific to the historical event data and/or specific to individual entities. As discussed above, the use of short identifiers can allow for behavioral analytics using data where only short identifiers are available (e.g., credit card transaction histories that include only the last four digits of each credit card number), can reduce storage requirements for the historical event data, can improve processing speeds of behavioral analytics devices, can improve transmission speeds in a distributed behavioral analytics system, and can provide additional technical benefits.

In some embodiments, a collision decay analysis can be performed using historical event data to compute an average percentage of identifier collisions between disjoint events in the historical raw data. As used herein, disjoint events are events where a probability that the same entity would be associated with both events is very low (e.g., less than 1% or less than 5%). For example, disjoint events can be events that occur at geographically dispersed locations (e.g., an event that occurs in California and an event that occurs in Virginia).

Because disjoint events are unlikely to be associated with the same entity, even when the same identifier is associated with the disjoint events, that identifier is likely to be representing two different entities (i.e., an identifier collision). In a sample data set, the percentage of identifiers associated with or representing two different entities can be used to compute an average percentage of identifier collisions, as discussed in further detail below.

The average percentage of identifier collisions can then be accounted for in a cohort analysis performed using the historical event data to estimate future actions and assign new event values, as also discussed in further detail below.

FIG. 1 depicts an example of a system environment 100 in accordance with examples of the present disclosure. As shown in FIG. 1, the system environment 100 includes an administrative server system 102. The server system 102 includes a previous history processor 104 and a value generator 106. The server system 102 may further include, or be communicatively linked to, a history storage 108.

In various embodiments, the server system 102 can access historical event data from the history storage 108 and process the historical event data using the previous history processor 104. The processed historical event data can be used to generate new event values using the value generator 106. As used herein historical event data can include, among other things, dates and times of events, locations of events, short identifiers of entities associated with the events, values of the events, identifiers of types of the events, etc. One example of historical event data is records of purchases made with credit cards, where the credit card number is represented by only its last four digits.

The system environment 100 further includes a provider device 120 and a provider device 122. The provider devices 120 and 122 are devices that receive and transmit historical event data and/or new event data. For example, an owner of a gas station that includes a convenience store may operate point-of-sale systems represented by the provider device 120 and the provider device 122, where the provide device 120 collects event input for inside purchases at the convenience store and the provider device 122 collects event input for gasoline sales. As shown in FIG. 1, the provider device 120 and the provider device 122 can communicate with the server system 102 via a network 114.

In some embodiments, the provider devices 120 and 122 may access a graphical user interface provided, for example, by the server system 102, in order to provide event data to and receive new event values from the server system 102. Using the graphical user interface, information may be provided to the server system 102 regarding the provider devices 120 and 122, such as, for example, a name and address of the location of the provider devices 120 and 122, a number and location of point-of-sale terminals at the location, previous event information (e.g., credit card transaction logs), products and/or services that are associated with events at the location, prices for products and/or services that are offered at the location, volumes of sales of products and/or services at the location, baseline values to be achieved for sales of products and/or services, expected revenue growth at the location, and profit margin information for products and/or services that are offered at the location.

The system environment 100 may further include a client device 110 and a client device 112. The client devices 110 and 112 may be operated by entities that are associated with events at locations associated with the provider devices 120 and 122, such as smart phones operated by customers that purchase gas and snacks at the gas station/convenience store. The client devices 110 and 112 may receive, via a graphical user interface, new event information (e.g., a new event value) regarding a product or service associated with a location. In various implementations, the new event information may be, for example, an electronic coupon for a discount on a product or service associated with a location. This information may be transmitted from the server system 102 to the client devices 110 and 112. Similarly, the client devices 110 and 112 may provide information to the server system 102, such as a current location, an identifier (e.g., a short identifier) associated with the device or a user of the device, etc.

It may be appreciated that additional provider devices, client devices, and server systems may reside within the system environment 100.

FIG. 2 depicts an example of a flow diagram of a process for performing a collision decay analysis, in accordance with some examples of the present disclosure. The process depicted in FIG. 2 may be performed using a computing device such as, for example, the server system 102 (e.g. using the previous history processor 104) as depicted in FIG. 1.

The process can begin in 200, when the computing device receives historical event data that includes event data associated with multiple identifiers and multiple locations. The identifiers can be short identifiers, such as, for example, four digit integers.

In some embodiments, other data that can be included in the historical event data can include name and address information for the event location, an identifier of a point-of-sale terminal at the event location, etc.

In 210, the computing device can select a first location from among the multiple locations in the historical event data. In some embodiments, the first location can be selected randomly. In other embodiments, the first location can be selected programmatically (e.g., the location associated with a first event sequentially in the historical event data, a non-centralized location, etc.). In further embodiments, the first location can be user-selected.

In 220, the computing device can select a second location from among the multiple event locations in the historical event data, where the second location is geographically dispersed from the first location. In some embodiments, the second location can be selected randomly based on certain criteria (e.g., selected randomly from among locations that are greater than a threshold distance away from the first location, such as, for example, more than 1,000 kilometers). In other embodiments, the second location can be selected programmatically (e.g., the location associated with a first event sequentially in the historical event data that is greater than a threshold distance away from the first location, etc.). In further embodiments, the second location can be user-selected. However, an error message may be sent to the user if the user-selected second location is closer than a threshold distance away from the first location.

In 230, the computing device can determine an average percentage of identifier collisions by determining the number of identical short identifiers that occur at the first location and at the second location.

For example, the events can represent purchases from a first convenience store at a first location and a second convenience store at a second location that is greater than 1,000 kilometers away from the first location. The historical event data can be a history of credit card transactions at the first and the second location, and the short identifier can be the last four digits of the credit card numbers. Generally, there is a low probability (e.g., less than 1%) that the owner of the same credit card would make a purchase at the first convenience store and a purchase as the geographically dispersed second convenience store within a limited time window (e.g., one month). Therefore, if the same last four digits represent events that occurred at both the first and the second convenience store, these events can be identified or labelled as an identifier collision. The number of identifier collisions can be compared to (e.g., divided by) the total number of unique identifiers at one or both locations to determine or calculate the average percentage of identifier collisions.

In some embodiments, 200-230 can be performed multiple times for multiple sets of geographically dispersed locations. The average percentage of identifier collisions from multiple iterations of 200-230 using the multiple data sets can be aggregated and/or averaged to determine a global average percentage of identifier collisions.

In other embodiments, 200-230 can be performed multiple times for multiple sets of geographically dispersed locations with different population sizes. The population size can represent the number of unique entities associated with events at an event location during a time window. The average percentage of identifier collisions from multiple iterations of 200-230 using the multiple data sets can be aggregated and/or analyzed to determine an average percentage of identifier collisions based on the population size associated with the location.

In further embodiments, 200-230 can be performed multiple times for multiple time windows. For example, the first iteration can be for data within a first time window (e.g., a month) and the second iteration can be for data within a second time window (e.g., a subsequent month). The data from the multiple iterations can be aggregated and/or averaged to determine an average percentage of identifier collisions.

In various embodiments, the average percentage of identifier collisions, the global average percentage of identifier collisions, and/or the average percentage of identifier collisions based on the population size can be used to mitigate the effects of identifier collisions in behavioral analytics, as discussed in further detail below.

FIG. 3 depicts an example of a flow diagram of a process for performing a cohort analysis, in accordance with some examples of the present disclosure. The process depicted in FIG. 3 may be performed using a computing device such as, for example, the server system 102 (e.g. using the previous history processor 104) as depicted in FIG. 1.

The process can begin in 300, when the computing device receives historical event data that includes event data associated with a location, multiple identifiers, and multiple event types, and obtains a base time window for initiating the analysis. The identifiers can be short identifiers, such as, for example, four digit integers. The base time window can be a time window of any length with a specific starting time. For example, the base time window can be a time period of one month that starts on the first day of a specific calendar month (e.g., Jan. 1, 2017 at 12:00 AM).

In 310, the computing device can receive a fixed time window length for performing a cohort analysis. In some embodiments, the fixed time window length can be a multiple of the length of the base time window received in 300. For example, if the base time window length is one month, the fixed time window length can be three months.

In 320, the computing device can begin a first iteration of 320-345 for a first identifier. The computing device can obtain a first identifier associated with at least one event in the historical event data. The first identifier can be obtained by, for example, obtaining an identifier associated with a first event sequentially and/or chronologically in the historical event data, obtaining an identifier associated with a random event in the historical event data, obtaining a user-selected identifier, etc.

In 330, the computing device can assign the identifier to a cohort associated with a first event type. In some embodiments, each event type can be associated with multiple cohorts, and each cohort can represent, for example, a number of times that an identifier is associated with the event type during the base time window. As an example, the base time window can be Jan. 1, 2017-Jan. 31, 2017 and the event type can be a gasoline purchase at a gas station/convenience store. This event type can be associated with six cohorts representing zero, one, two, three, four, and five or more gasoline purchase events by a single identifier during the base time window. Accordingly, if the identifier obtained is associated with one gasoline purchase event during the base time window, the identifier can be assigned to the cohort associated with one gasoline purchase.

In various embodiments, each event type can be associated with a table, a database, or other data structure the represents the cohorts and the associated with identifiers. As noted previously, in various embodiments, the use of short identifiers provides a technical improvement by allowing the system to store the data structure in less space and to process it more efficiently than would be possible using full-length identifiers, such as 16-digit credit card numbers.

In 340, the computing device can assign the identifier to a cohort associated with a second event type that is different from the first event type but is associated with the same location (e.g., a gas station/convenience store). For example, if the first event type is a gasoline purchase, the second event type can be a convenience store purchase. In some embodiments, the identifier can be assigned to a cohort based on a number of times the identifier is associated with a specific event type (e.g., the second event type) during the base time window.

In various embodiments, the first event type and the second event type can be correlated with each other because the first event type and the second event type are associated with the same location (e.g., a gas station/convenience store).

In 345, the computing device can determine if there are additional identifiers that have not been assigned to cohorts for the first and second event types. If there are additional identifiers (345; YES), the computing device can begin a subsequent iteration of 320-345 by obtaining a subsequent identifier associated with at least one event in the historical event data. The subsequent identifier can be obtained by, for example, obtaining an identifier associated with a next event sequentially and/or chronologically in the historical event data, obtaining an identifier associated with a random event in the historical event data, obtaining a user-selected identifier, etc. Then the computing device can perform 330 and 340 for the subsequent identifier.

In some embodiments, 320-345 can be performed for each identifier in the historical event data, and/or until, in 345, the computing device determines that there are no additional identifiers in the historical event data (345; NO). Then the computing device can proceed to 350.

In 350, the computing device can perform a behavioral analytics process on each cohort and event type to predict future behavior based on the historical event data for the base time window. The behavioral analytics process can be performed using, for example, a computer learning behavioral analytics algorithm. In various embodiments, the results of the behavioral analytics process can be behavioral models for each cohort and event type.

In some embodiments, the computer learning behavioral analytics algorithm can be performed and/or the behavioral models can be adjusted using an average percentage of identifier collisions (e.g., as determined in FIG. 2). Accordingly, if the historical event data uses short identifiers, the identifier collisions can be accounted for and mitigated against.

In some implementations, a cohort analysis can be performed for a time window subsequent to the base time window in the fixed time window, while, in further implementations, cohort analyses can be performed for the time window subsequent to the base time window, as well as for each other time windows in the fixed time window.

A single iteration of 320-350 can represent a cohort analysis for a single event type. In some embodiments, the cohort analysis can additionally be for a single location. In further embodiments, additional iterations of 320-350 can be performed for different event types at the same location and/or for the same or different event types at different locations. Cohort analyses for different event types (e.g., gasoline purchases, convenience store purchases, automotive services, car washes, etc.) at the same location can be used for obtaining new event values, as described, for example, in FIGS. 6-7.

FIG. 4 depicts an example of a flow diagram of a process for performing a cohort analysis, in accordance with some examples of the present disclosure. The process depicted in FIG. 4 may be performed using a computing device such as, for example, the server system 102 (e.g. using the previous history processor 104) as depicted in FIG. 1.

The process can begin in 400, when the computing device performs a cohort analysis for a base time window (e.g., similar to that described above with regard to FIG. 3).

In 410, the computing device can assign a subsequent time window (e.g., the next time period in chronological sequence and/or as received in 310 in FIG. 3) as the new base time window and the fixed time window can be correspondingly adjusted. For example, if the base time window is January 2017 and the fixed window is three months (i.e., January 2017-March 2017), the new base time window can be February 2017 and the adjusted fixed window can be February 2017-April 2017.

In 420, the computing device can perform the cohort analysis (e.g., similar to the cohort analysis described above with regard to FIG. 3) but with the new base time window as the base time window. 410-420 can be repeated multiple times. Accordingly, cohort analyses can be performed multiple times over a rolling sequence of time windows in order to analyze the predictability of event types over a longer course of time. Thus, a more accurate behavioral analytics process can be performed and future behavior can be derived or predicted by taking, for example, the average, minimum, and maximum values from across each of the multiple cohort analyses.

In various embodiments, a cohort analysis can be used to compute probabilities of an identifier being associated with an event having a specific event type during a specified future or current time window. Because the identifier is associated with a cohort, the behavior associated with all identifiers in the cohort can be used to model future behavior, resulting in a more accurate behavior model that can be used to compute the probabilities of a specific event occurring in the future.

FIG. 5 depicts an example of a flow diagram of a process for obtaining a new event value based on a probability of a future event, in accordance with some examples of the present disclosure. The process depicted in FIG. 5 may be performed using a computing device such as, for example, the server system 102 (e.g. using the previous history processor 104) as depicted in FIG. 1.

In various embodiments, the process depicted in FIG. 5 can be initiated based on receiving an indication that a new event at a specific location has occurred. In other embodiments, the process can be initiated automatically or based on user input. In further embodiments, the process can be initiated based on an indication that a client device (e.g., one of client device 110 and 112 shown in FIG. 1) is near the specific location. For example, the client device can be a smartphone that determines its current position using a Global Positioning System (GPS) receiver, and the client device can send the current position and an indicator of an identifier associated with the client device to the computing device (e.g., to the server 102).

In 500, the computing device can obtain a probability associated with an identifier, an event type, and a location. For example, if the computing device received an indication that a client device associated with an identifier is near the location, the computing device can obtain a probability associated with the identifier, an event type, and the location.

In some embodiments, the probability can be obtained based on an individual identifier historical event analysis. In such embodiments, a probabilistic model can be used to describe a baseline of expected behavior specific to each identifier. The history for a location is analyzed over a fixed historical period (e.g., three months), and the average number of actual events of the event type at the location made by each unique identifier is computed over a fixed frequency time span (e.g., four weeks) during the fixed historical period. The average expected number of future events of the event type at the location for a unique identifier over the fixed historical period for the frequency time span can then be modeled. For example, the average expected number of future events can be modeled using the formula:

${{expected}\mspace{14mu} {number}\mspace{14mu} {of}\mspace{14mu} {future}\mspace{14mu} {events}_{f}^{ID}} = \frac{\sum_{i = 0}^{n}{{actual}\mspace{14mu} {events}_{i}^{ID}}}{n}$

where f is the frequency time span, ID is the identifier, n is the number of time spans of frequency f in the fixed historical period, actual events_(i) ^(ID) is the actual number of events associated with the identifier during the ith frequency time span in the fixed historical period, and expected number of future events_(i) ^(ID) is the average number of expected events that the identifier is expected to be associated with in a time period off.

In 510, the computing device can determine a minimum value for the event type at the location. In various embodiments, the minimum value can be a value that is specified by the owner of the location. For example, if the event type is a gasoline purchase, the minimum value can represent a minimum value at which the owner will sell gasoline (e.g., $2.00 per gallon). In other embodiments, the minimum value can be programmatically determined based on, for example, an indicated margin on average transaction values (e.g., indicated in a cash value or as a percentage of a transaction).

In 520, the computing device can obtain a weighting value for the event type. The weighting value can be a value that allows for the adjustment of the event value based on incentives or disincentives for specific cohorts. In some embodiments, the weighting value can be automatically calculated based on observed changes in event patterns associated with an identifier over time in response to different event values. In other embodiments, the weighting value can be obtained by the owner, and can be different values for different cohorts.

In 530, the computing device can obtain a new event value based on the probability, the minimum value, and the weighting value. For example, the new event value can be computed based on the product of the minimum value, the weighing value, and one minus the probability.

In various embodiments, the new event value can be transmitted to a client device (e.g., one of client devices 110 and 112 shown in FIG. 1). Accordingly, for example, when the client device is determined to be near to a location, the client device can notify the computing device, the computing device can determine or calculate a new event value, the computing device can transmit the new event value to the client device, and the client device can display the new event value to a user (e.g., using a corresponding graphical user interface). For example, the computing device can calculate a new gasoline discount (e.g., $0.40 cents per gallon off) for a station near the client device and send a corresponding electronic coupon to the client device for display to the potential customer.

FIG. 6 depicts an example of a flow diagram of a process for determining an event value based on an identifier without a history, in accordance with some examples of the present disclosure. The process depicted in FIG. 6 may be performed using a computing device such as, for example, the server system 102 (e.g. using the previous history processor 104) as depicted in FIG. 1.

In various embodiments, the process depicted in FIG. 6 can be initiated based on receiving an indication that a new event at a specific location has occurred. In other embodiments, the process can be initiated automatically or based on user input. In further embodiments, the process can be initiated based on an indication that a client device (e.g., one of client device 110 and 112 shown in FIG. 1) is near the specific location. For example, the client device can be a smartphone, the client device can execute an application that determines a current position using a GPS receiver, and sends the current position and an indicator of an identifier associated with the client device to the computing device.

In 600, the computing device can obtain an identifier. For example, if the computing device received an indication of a new event, the computing device can obtain the identifier associated with the new event as well as, for example, an event type and the location of the new event.

In 610, the computing device can determine if the obtained identifier is associated with a previous event of a first event type at the location during a look-back window. For example, the first event type can be a gasoline purchase, and the computing device can determine if the identifier is associated with a previous gasoline purchase during the look-back window (e.g., the previous three months).

In some embodiments, the length of the look-back window can be configured automatically or based on user input. In embodiments where short identifiers are used, the longer the length the look-back window, the more identifier collisions are likely to occur. Accordingly, a collision decay analysis can be used to determine the length of the look-back window so that the risk of an identifier collision is minimized (e.g., is less than 40%, 20%, 10%, 5%, or 1%).

If, in 610, the computing device determines that the identifier is associated with a previous event of the first event type at the location during the look-back window (610; YES), the process can proceed to 710, as shown in FIG. 7 (described below).

If, in 610, the computing device determines that the identifier is not associated with a previous event of the first event type at the location during the look-back window (610; NO), the process can proceed to 620.

In 620, the computing device can obtain a total value of events associated with the first event type during a current analysis period (e.g., the current month) and can obtain a baseline value associated with the first event type for an analysis period and can determine if the total value of events exceeds the baseline. In some embodiments, the total value of events associated with the first event type can be a total amount of payments received for a particular event type during the current analysis period (e.g., a total amount paid for gasoline purchases).

In some implementations, the baseline value can be determined using an aggregate historical total event value analysis. For example, from historical event data for a location (e.g., a credit card transaction history), a trend analysis can compute revenue growth for the location over the time period of the transaction history based on all event types at the location. The total number of all types of events at the location and the total event values over a fixed frequency time span (e.g., month-to-month) can inform a model of the actual growth trends of the location, which can be used to predict expected growth for the location. The baseline value can be determined using the expected growth, and, in particular, based on an expected total value of events during the analysis period. In other implementations, the baseline value can be determined based on user input.

If, in 620, the computing device determines that the total value of events does not exceed the baseline (620; NO), the process can proceed to 630.

In 630, the computing device can notify a client device of a new event value. In various embodiments, the client device can be a client device that is associated with the identifier obtained in 600. The client device can be running an application, and the application can automatically display or otherwise notify a user of the new event value (e.g., using a graphical user interface). As an example, the new event value can be a new price for gasoline. The new event value can be determined, in some embodiments, using the process described above with regard to FIG. 5.

If, in 620, the computing device determines that the total value of events does exceed the baseline (620; Yes), the process can proceed to 640.

In 640, the computing device can obtain a total value of events associated with a second event type for the same location as the first event type during a current analysis period (e.g., the current month) and can obtain a baseline value associated with the second event type for an analysis period (e.g., based on an aggregate historical total event value analysis or as input by a user) and can determine if the total value of events exceeds the baseline. In some embodiments, the total value of events associated with the second event type can be a total amount of payments received for a second event type during the current analysis period (e.g., a total amount paid for convenience store purchases).

If, in 640, the computing device determines that the total value of events does not exceed the baseline (640; NO), the process can proceed to 660.

In 660, the computing device can notify the client device of an intermediate event value. The client device can be running an application, and the application can automatically display or otherwise notify a user of the new event value (e.g., using a graphical user interface corresponding to the application). As an example, the intermediate event value can be a new price for gasoline that is lower than a regular price but is higher than a new event value that would otherwise be determined in 630 or 650. The intermediate event value can be determined, in some embodiments, based on the process described above with regard to FIG. 5. For example, the intermediate event value can be a numerical average between the new event value, as determined in FIG. 5, and a regular event value.

If, in 640, the computing device determines that the total value of events does exceed the baseline (640; Yes), the process can proceed to 650.

In 650, in some embodiments, the computing device can notify the client device of a new event value for the second event type. The client device can be running an application, and the application can automatically display or otherwise notify a user of the new event value. As an example, the new event value can be a percentage discount on convenience store purchase. The new event value can be determined, in some embodiments, using the process described above with regard to FIG. 5.

In other embodiments, the computing device can notify the client device of a new event value for another event type, such as the first event type (e.g., a discounted price on gasoline) or event types that are not used in 600-640 but are associated with the same location (e.g., a discounted price on a car wash, automotive services, etc.).

In further embodiments, the computing device may not immediately notify the client device, but may store the new event value to notify the client device at a future time. For example, the next time the identifier associated with the client device is associated with the same location, the notification can be sent to the client device.

In still further embodiments, the computing device can store multiple notifications for the same identifier at the same location, and the values of the notifications can accumulate with each new notification. For example, if a first new event value is a $1.00 discount and a second notification is stored for the same identifier and the same location, the computing device may send a notification of a new event value where the new event value is an increased discount (e.g., a $1.50 or a $2.00 discount). If the discount is not used, the next time a new event value is stored for the same identifier and the same location, the notification sent to the user can include a new event value that is a further increased discount (e.g., a $2.15 or a $3.00 discount).

FIG. 7 depicts an example of a flow diagram of a process for determining an event value based on an identifier with a history, in accordance with some examples of the present disclosure. The process depicted in FIG. 7 may be performed using a computing device such as, for example, the server system 102 (e.g. using the previous history processor 104) as depicted in FIG. 1.

In various embodiments, the process depicted in FIG. 7 can be initiated if, in 610 from FIG. 6, the computing device determines that the identifier is associated with a previous event of the first event type at the location during the look-back window (610; YES), as described above.

In 710, the computing device can determine, for the identifier obtained in 600 in FIG. 6, a corresponding cohort for the first event type and a corresponding cohort for the second type. The corresponding cohorts can be determined based on a previously performed cohort analysis (e.g., as described above and shown in FIG. 3).

Based on the determined cohorts, the computing device can determine an expected frequency of occurrence of each event type for the cohort. In some embodiments, an expected frequency can be determined based on the behavior analytics processes performed on each cohort (e.g., in 350 of FIG. 3).

In 720, the computing device can determine whether an actual first event frequency during a current time window is greater than or equal to an expected frequency. For example, based on previously event data, the computing device can determine that the identifier is associated with one previous event of the first event type during the current time window (e.g., the current calendar month).

If, in 720, the computing device determines that the actual first event frequency is greater than or equal to the expected frequency (720; YES), the process can proceed to 730. In other words, the computing device can determine that the type of event has occurred as many times or more than is expected.

In 730, the computing device can notify a client device of a new event value for the first type of event. In various embodiments, the client device can be a client device that is associated with the identifier obtained in 600. The client device can be running an application, and the application can automatically display or otherwise notify a user of the new event value for the first type of event. As an example, the new event value can be a new price for gasoline. The new event value can be determined, in some embodiments, using the process described above with regard to FIG. 5.

In other embodiments, the computing device can notify the client device of a new event value for another event type, such as the second event type (e.g., a percentage discount on convenience store purchases) or event types that are not used in 710 and 720 but are associated with the same location (e.g., a discounted price on a car wash, automotive services, etc.).

In further embodiments, the computing device may not immediately notify the client device, but may store the new event value to notify the client device at a future time.

In still further embodiments, the computing device can store multiple notifications for the same identifier at the same location, and the values of the notifications can accumulate with each new notification. If the discount is not used, the next time a new event value is stored for the same identifier and the same location, the notification sent to the user can include a new event value that is a further increased discount.

If, in 720, the computing device determines that the actual first event frequency is less than the expected frequency (720; NO), the process can proceed to 740.

In 740, the computing device can determine if an actual second event type frequency during a current time window is greater than or equal to an expected frequency for the identifier and the second event type. For example, based on previously event data, the computing device can determine that the identifier is associated with two previous events of the second event type during the current time window (e.g., the current calendar month).

If, in 740, the computing device determines that the actual second event type frequency is greater than or equal to the expected frequency (740; YES), the process can proceed to 760.

In 760, the computing device can notify the client device of a new event value for the second event type. The client device can be running an application, and the application can automatically display or otherwise notify a user of the new event value for the second type of event. As an example, the new event value can be a percentage discount on convenience store purchases. The new event value can be determined, in some embodiments, using the process described above with regard to FIG. 5.

In other embodiments, the computing device can notify the client device of a new event value for another event type, such as the first event type (e.g., a discounted price on gasoline) or event types that are not used in 710-740 but are associated with the same location (e.g., a discounted price on a car wash, automotive services, etc.).

In further embodiments, the computing device may not immediately notify the client device, but may store the new event value to notify the client device at a future time.

In still further embodiments, the computing device can store multiple notifications for the same identifier at the same location, and the values of the notifications can accumulate with each new notification. If the discount is not used, the next time a new event value is stored for the same identifier and the same location, the notification sent to the user can include a new event value that is a further increased discount.

If, in 740, the computing device determines that the actual second event type frequency is less than the expected frequency (740; NO), the process can proceed to 750.

In 750, the computing device can notify the client device of an intermediate event value. As an example, the intermediate event value can be a percentage discount for a convenience store purchase that is greater than zero but is a lower percentage than a new event value that would otherwise be determined in 760. The intermediate event value can be determined, in some embodiments, based on the process described above with regard to FIG. 5. For example, the intermediate event value can be a numerical average between the new event value, as determined in FIG. 5, and a regular event value.

FIG. 8 is a diagram illustrating an example of a hardware system for performing and using behavioral analytics, consistent with certain disclosed embodiments. An example hardware system 800 includes example system components that may be used. The components and arrangement, however, may be varied.

The computer 801 may include a processor 810, a memory 820, a storage 830, and input/output (I/O) devices (not pictured). The computer 801 may be implemented in various ways and can be configured to perform any of the embodiments described above. In some embodiments, the computer 801 can be a client device such as, for example, a desktop computer, a laptop, a mobile device (e.g., a smartphone or a tablet device), etc. In other embodiments, the computer 801 can be a computing device such as, for example, a database server, a web server, a mainframe computer, etc. For example, the computer 801 can be one or more of the server system 102, the provider devices 120 and 122, and the client devices 110 and 112 in FIG. 1. The computer 801 may be standalone or may be part of a subsystem, which may, in turn, be part of a larger system.

The processor 810 may include one or more known processing devices, such as a microprocessor from the Intel Core™ family manufactured by Intel™, the Phenom™ family manufactured by AMD™, or the like. The memory 820 may include one or more storage devices configured to store information and/or instructions used by the processor 810 to perform certain functions and operations related to the disclosed embodiments. The storage 830 may include a volatile or non-volatile, magnetic, semiconductor, tape, optical, removable, non-removable, or other type of computer-readable medium used as a storage device. In some embodiments, the storage 830 can include, for example, historical event data.

In an embodiment, the memory 820 may include one or more programs or subprograms including instructions that may be loaded from the storage 830 or elsewhere that, when executed by the computer 801, perform various procedures, operations, or processes consistent with disclosed embodiments. For example, the memory 820 may include a behavioral analytics program 825 for performing collision decay analyses, performing cohort analyses, obtaining new event values, communicating information to other devices, providing a graphical user interface, notifying users of new event values or incremental event values, etc., according to various disclosed embodiments. The memory 820 may also include other programs that perform other functions, operations, and processes, such as programs that provide communication support, Internet access, etc. The behavior analytics program 825 may be embodied as a single program, or alternatively, may include multiple sub-programs that, when executed, operate together to perform the function of the behavior analytics program 825 according to disclosed embodiments. In some embodiments, the behavior analytics program 825 can perform all or part of the processes of FIGS. 2-7, described above.

The computer 801 may communicate over a link with a network 840. For example, the link may be a direct communication link, a local area network (LAN), a wide area network (WAN), or other suitable connection. The network 840 may include the internet, as well as other networks, which may be connected to various systems and devices.

The computer 801 may include one or more input/output (I/O) devices (not pictured) that allow data to be received and/or transmitted by the computer 801. I/O devices may also include one or more digital and/or analog communication I/O devices that allow the computer 801 to communicate with other machines and devices. I/O devices may also include input devices such as a keyboard or a mouse, and may include output devices such as a display or a printer. The computer 801 may receive data from external machines and devices and output data to external machines and devices via I/O devices. The configuration and number of input and/or output devices incorporated in I/O devices may vary as appropriate for various embodiments.

Example uses of the system 800 can be described by way of example with reference to the embodiments described above.

While the teachings have been described with reference to the exemplary embodiments thereof, those skilled in the art will be able to make various modifications to the described embodiments without departing from the true spirit and scope. The terms and descriptions used herein are set forth by way of illustration only and are not meant as limitations. In particular, although the method has been described by examples, the steps of the method may be performed in a different order than illustrated or simultaneously. Furthermore, to the extent that the terms “including”, “includes”, “having”, “has”, “with”, or variants thereof are used in either the detailed description and the claims, such terms are intended to be inclusive in a manner similar to the term “comprising.” As used herein, the term “one or more of” with respect to a listing of items such as, for example, A and B, means A alone, B alone, or A and B. Those skilled in the art will recognize that these and other variations are possible within the spirit and scope as defined in the following claims and their equivalents. Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims. 

What is claimed is:
 1. A system comprising: a processing system of a device comprising one or more processors; and a memory system comprising one or more computer-readable media, wherein the one or more computer-readable media contain instructions that, when executed by the processing system, cause the processing system to perform operations comprising: obtaining historical event data comprising a plurality of events and a plurality of short identifiers, wherein each event of the plurality of events is associated with a short identifier of the plurality of short identifiers; selecting a first event location associated with a first event of the plurality of events; selecting a second event location associated with a second event of the plurality of events that is geographically dispersed from the first event location; determining identifier collisions based on short identifiers associated with the first event location and short identifiers associated with the second event location in the historical event data; determining an average percentage of identifier collisions based on the identifier collisions that were determined; performing behavior analytics on the plurality of short identifiers in the historical event data to generate behavioral models associated with the plurality of short identifiers; generating adjusted behavioral models based on the average percentage of identifier collisions; obtaining a short identifier from a user device; determining, based on the short identifier from a user device, an adjusted behavioral model of the adjusted behavioral models; determining an expected event frequency based on the adjusted behavioral model; determining an actual event frequency; and transmitting an indication of a new event value to the client device based on a determination that the expected event frequency is less than the actual event frequency, whereby the client device displays a notification corresponding to the indication of the new event value.
 2. The system of claim 1, wherein the plurality of short identifiers are integers.
 3. The system of claim 2, wherein the integers are stored as short integer variables.
 4. The system of claim 2, wherein the integers are four digit integers.
 5. The system of claim 1, wherein the determining the average percentage of identifier collisions comprises determining identifier collisions between multiple sets of geographically dispersed event locations.
 6. The system of claim 1, wherein performing the behavior analytics comprises performing a cohort analysis for a base time window by assigning each identifier of the plurality of identifiers to a cohort of a plurality of cohorts associated with an event type based on a number of events associated with the identifier and the event type.
 7. The system of claim 6, wherein the behavior analytics are performed on each cohort of the plurality of cohorts.
 8. The system of claim 6, wherein performing the behavior analytics further comprises: assigning a subsequent time window as a new base time window; and performing a cohort analysis for the new base time window.
 9. The system of claim 1, the operations further comprising determining the new event value based on an actual number of events, an expected number of events, a minimum value, and a weighting value.
 10. The system of claim 1, the operations further comprising: obtaining a second short identifier from a second user device; determining that the second short identifier is not one the plurality of short identifiers; and transmitting an indication of a second new event value associated with an event to the second client device based on a determination that a total value of the event is less than a baseline value of the event.
 11. A method comprising: obtaining historical event data comprising a plurality of events and a plurality of short identifiers, wherein each event of the plurality of events is associated with a short identifier of the plurality of short identifiers; selecting a first event location associated with a first event of the plurality of events; selecting a second event location associated with a second event of the plurality of events that is geographically dispersed from the first event location; determining identifier collisions based on short identifiers associated with the first event location and short identifiers associated with the second event location in the historical event data; determining an average percentage of identifier collisions based on the identifier collisions that were determined; performing, using one or more processors, behavior analytics on the plurality of short identifiers in the historical event data to generate behavioral models associated with the plurality of short identifiers; generating adjusted behavioral models based on the average percentage of identifier collisions; obtaining a short identifier from a user device; determining, based on the short identifier from a user device, an adjusted behavioral model of the adjusted behavioral models; determining an expected event frequency based on the adjusted behavioral model; determining an actual event frequency; and transmitting an indication of a new event value to the client device based on a determination that the expected event frequency is less than the actual event frequency, whereby the client device displays a notification corresponding to the indication of the new event value.
 12. The method of claim 11, wherein the plurality of short identifiers are integers.
 13. The method of claim 12, wherein the integers are stored as short integer variables.
 14. The method of claim 12, wherein the integers are four digit integers.
 15. The method of claim 11, wherein the determining the average percentage of identifier collisions comprises determining identifier collisions between multiple sets of geographically dispersed event locations.
 16. The method of claim 11, wherein performing the behavior analytics comprises performing a cohort analysis for a base time window by assigning each identifier of the plurality of identifiers to a cohort of a plurality of cohorts associated with an event type based on a number of events associated with the identifier and the event type.
 17. The method of claim 16, wherein the behavior analytics are performed on each cohort of the plurality of cohorts.
 18. The method of claim 16, wherein performing the behavior analytics further comprises: assigning a subsequent time window as a new base time window; and performing a cohort analysis for the new base time window.
 19. The method of claim 11, the operations further comprising determining the new event value based on an actual number of events, an expected number of events, a minimum value, and a weighting value.
 20. The method of claim 11, the operations further comprising: obtaining a second short identifier from a second user device; determining that the second short identifier is not one the plurality of short identifiers; and transmitting an indication of a second new event value associated with an event to the second client device based on a determination that a total value of the event is less than a baseline value of the event. 